ON THE SMALL BALL INEQUALITY IN ALL DIMENSIONS 



DMITRIY BILYK, MICHAEL T. LACEY, AND ARMEN VAGHARSHAKYAN 

Abstract. Let Hr denote an L°° normalized Haar function adapted to a dyadic rectangle 
R c [0, lY- We show that for choices of coefficients a{R), we have the following lower 
bound on the L°° norms of the sums of such functions, where the sum is over rectangles of 
a fixed volume: 

n"^"''! a{R)hR{x) ^ > 2"" for some < ;] < - . 

|R|=2-" ^ ^ \R\=2-" 

The point of interest is the dependence upon the logarithm of the volume of the rectangles. 
With n^''~'^y^ on the left above, the inequality is trivial, while it is conjectured that the inequal- 
ity holds with n'''"^^^^. This is known in the case of d - 2(Talagrand, 1994), and a recent paper 
of two of the authors (Bilyk and Lacey, 2006) proves a partial result towards the conjecture 
in three dimensions. In this paper, we show that the argument of (Bilyk and Lacey, 2006) 
can be extended to arbitrary dimension. We also prove related results in the subjects of the 
Irregularity of Distribution, and Approximation Theory. The authors are unaware of any 
prior results on these questions in any dimension d > 4. 



1. The Small Ball Conjectures 

In this paper we will prove results in dimension four and higher in three separate 
areas. Number Theory, Approximation Theory, and Probability Theory: (a) the theory 
of Irregularities of Distribution, (b) the Kolmogorov Entropy of spaces of functions with 
bounded mixed derivative, and (c) Small Deviation Inequalities for the Brownian Sheet. 
As far as the authors are aware, these are the first results on these questions which provide 
more information than that given by an average case analysis. Underlying these three 
results is a central inequality, the Small Ball Inequality for the Haar functions, which we 
state here. The related areas are addressed in the next section. 

In one dimension, the class of dyadic intervals is !D := {[j2'^, {j + 1)2'^) : j,k e Z}. Each 
dyadic interval has a left and right half, indicated below, which are also dyadic. Define 
the Haar functions 

Note that this is an L°° normalization of these functions, which we will keep throughout 
this paper. 

In dimension d, a dyadic rectangle is a product of dyadic intervals, thus an element of 

T)^. We define a Haar function associated to R to be the product of the Haar functions 
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associated with each side of R, namely 



/=1 

This is the usual 'tensor' definition. 

We will concentrate on rectangles with fixed volume and consider a local problem. This 
is the 'hyperbolic' assumption, that pervades the subject. Our concern is the following 
Theorem and Conjecture concerning a lower bound on the L°° norm of sums of hyperbolic 
Haar functions: 

Small Ball Conjecture 1.1. For dimension d >3we have the inequality 
(1.2) 2-" Yj ^ n^^'^-^^W Yj «(^)^« 



|Ri=2-» 



|R|>2-» 



Average case analysis — that is passing through — shows that we always have 



2-" Yj ^ "^^'^"^^1 Y ^^^^^^ 



|R[=2-» 



|R|>2-» 



Namely, the constant on the right is bigger than in the conjecture by a factor of yfn. We 
refer to this as the 'average case estimate,' and refer to improvements over this as a 'gain 
over the average case estimate.' Random choices of coefficients a{R) show that the Small 
Ball Conjecture is sharp. 

In dimension d = 2, the Conjecture was resolved by (Talagrand, 1994).^ 

Talagrand's Theorem 1.3. For dimensions d >2,we have 
(1.4) 2-" Y ^ II Z 

|R|=2-» |R|=2-« 

Here, the sum on the right is taken over all rectangles with area at least 2~". 

The main result of this note is the next Theorem, which shows that there is a gain over 
the trivial bound in the Small Ball Conjecture in dimensions d > 3. In dimension d = 3, this 
result was proved in (Bilyk and Lacey, 2006). The three-dimensional result and its present 
extension build upon the method devised by (Beck, 1989). As far as the authors are aware, 
this is the first 'gain over the average case bound' known in dimensions four and higher. 

Theorem 1.5. In dimension d > 3, there exists a number r](d) > such that for all choices of 
coefficients a{R), we have the inequality 

I Y cc(R)hR ^ 2"" Y • 

iR|>2-» °° |R|=2-« 



(1.6) n 



^This result should be compared to (Schmidt, 1972), as well as (Halasz, 1981; Temlyakov, 1995). 



SMALL BALL INEQUALITY IN ALL DIMENSIONS 



3 



We take this Theorem as basic to our study, and use its proof to derive results on the 
three other questions mentioned at the beginning of the introduction. 

The principal difficulty in three and higher dimensions is that two dyadic rectangles 
of the same volume can share a common side length. Beck (Beck, 1989) found a specific 
estimate in this case, an estimate that is extended in (Bilyk and Lacey, 2006). In this note, 
the main technical device is the extension of this estimate, in the simplest instance, to 
arbitrary dimensions, see Lemma 5.2. This Lemma, and its extension to longer products 
Theorem 8.3, is the main technical innovation of this paper. The value of i] that we can get 
out of this line of reasoning appears to be of the order d~^, imputing additional interest to 
the methods of proof used to improve this estimate. Indeed, many aspects of our analysis 
are suboptimal, and the most essential techniques necessary to optimize the arguments of 
this paper are yet to be discovered. 

2. Related Results 

The L°° Norm of the Discrepancy Function. In d dimensions, take to be N points in 
the unit cube, and consider the Discrepancy Function 

(2.1) Dm(x) := jjyiN n [0,f) - N|[0,f)| . 

Here, [0, x) = ny=i[0, Xj), that is a rectangle with antipodal comers being and x. Relevant 
norms of this function must tend to infinity, in dimensions 2 and higher. The canonical 
result of this type is the following estimate proved in (Roth, 1954). 

K, Roth's Theorem 2.2. We have the universal estimate 

\\D^\\2>{logNY'-'^/\ 

with the implied constant only depending upon dimension. 

For all 1 < p < oo, ||D]v||p admits the same lower bound, a result in (Schmidt, 1977). 
The endpotnt estimates of p = 1, oo are however much harder, with definitive information 
known only in two dimensions. The method of proof of this Theorem, and the U variants 
can be summarized as follows: Fix 2N < 2" < N, and just project the Discrepancy Function 
onto the (hyperbolic) Haar functions {/zr : \R\ = 2""}. By the Bessel inequality, this provides 
a lower bound on the norm of D^. This same method of proof, with the Littlewood- 
Paley inequalities replacing the Bessel inequality, can be used to prove the LP lower bound, 
for 1 < p < oo. See (Beck and Chen, 1987). 

At L°°, guided by the sharpness of the Small Ball Conjecture, we pose the Conjecture 
below, which represents a ^J\ogN gain over the lower bound proved by Roth. 

The L°° Norm of Discrepancy Function Conjecture 2.3. In dimension d > 3, we have the 
lower estimate valid for all point sets ^1^. 

IIDmIU > (logN)''/^ 
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In dimension d = 2, this is the Theorem of (Schmidt, 1972). In dimension d = 3, 
(Beck, 1989; Bilyk and Lacey, 2006) give partial information about this conjecture. In this 
paper, we can prove the following result, which appears to be new in dimensions d > 4. 

Theorem 2.4. In dimension d >3 there is a positive t] = 7](rf) > Ofor which we have the uniform 
estimate 

The proof of this result follows easily from the method of proof of Theorem 1.5, and will 
be presented below. 

Metric Entropy of Mixed Derivative Sobolev Spaces. While the special structure of the 
Haar functions can be exploited to prove the Small Ball Conjecture, one would not antici- 
pate that this special structure is in fact essential to the Conjecture. Thus, we formulate a 
smooth variant of the Small Ball Conjecture. 

Fix a continuous non-constant function (p, supported on [-1/2, 1/2], and of mean zero. 
For a dyadic interval I, let 

be a translation and rescaling of (p so that it is supported on I. Then, for a dyadic rectangle 
R = R^X---xRa,set 

d 

Smooth Small Ball Conjecture 2.5. For dimension d >3we have the inequality 
(2.6) 2-" y \a{R)\ < n^^'^-Ml Y a{R)(pR . 

II oo 

|R|=2-" |R|>2-" 

The implied constant depends upon dimension d and <p only. 

In this direction, we will prove a result in the same spirit as our Main Theorem. 

Theorem 2.7. Suppose (p is continuous, supported on [-1/2,1/2], of mean zero, and such that 
{(p,/i[_i/2,i/2]) 0. For dimension d>3, there is a positive r] = rjid) so that we have the inequality 
below 

(2.8) 2"" y \a{R)\ < n5('^-i)-''|| V a{R)(pR . 

II CO 

|R|=2-» |R|>2-" 

The implied constant depends upon <p. 

With this Theorem, we can establish new results on the metric entropy of certain Sobolev 
spaces of functions with mixed derivative in certain U spaces. In d dimensions, consider 
the map 

Intrf f{xi, ■■■ ,Xd) := I • • • I /(yi, ■■■ ,yd) dyi--- dy^. 
Jo Jo 
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We consider this as a map from LP([0, 1]'^) into C([0, 1]''). Clearly, the image of Intd consists 
of functions with U integrable mixed partial derivatives. Let us set 

Ball(MWP([0,l]^)) := IntSf e LP([0,1]^) : ||/||p < 1}). 

That is, this is the image of the unit ball of U. This is the unit ball of the space of functions 
with mixed derivative in LP. 

These sets are compact in in C([0, lY), and it is of relevance to quantify the compactness, 
through the device of covering numbers. For < e < 1, set N{e, p, d) to be the least number 
N of points Xi, • • • , Xn s C([0, 1]'') so that 

N 

V,Si\\{MW{[Q, If)) c y (x„ + eB^) . 

n=l 

Here, B^o is the unit ball of C([0, 1]''). The task at hand is to uncover the correct order 
of growth of these numbers as e J, 0. The case of = 2 below follows from Talagrand 
(Talagrand, 1994), and the upper bound is known in full generality (Dunker et al., 1998; 
Temlyakov, 1995). 

Conjecture 2.9. For d>2 one has the estimate 

logN(e,p,rf) - e-l(logl/e)^-l/^ eiO. 

It is well known (Temlyakov, 1989) that results such as Theorem 2.7 can be used to give 
new lower bounds on these covering numbers. 

Theorem 2.10. For 1 <p < oo, and d>3, there isari > Ofor which we have 

\ogN{e,p,d)>e-\\ogl/e)'-'^'^. 

We have concentrated on the case of one mixed derivative, but various results on 
fractional derivatives are also interesting. See for instance (Kiihn and Linde, 2002), and 
(Dunker, 2000). 

The Small Ball inequality for the Brownian Sheet. Perhaps, it is worthwhile to explain 
the nomenclature 'Small Ball' at this point. The name comes from the probability theory. 
Assume that Xf : T — > R is a canonical Gaussian process indexed by a set T. The Small Ball 
Problem is concerned with estimates of P(supjg^ |Xf | < e) as e goes to zero, i.e the probability 
that the random process takes values in an L°° ball of small radius. The reader is advised 
to consult a paper by Li and Shao (Li and Shao, 2001) for a survey of this type of questions. 
A particular question of interest to us deals with the Brownian Sheet, that is, a centered 
Gaussian process indexed by the points in the unit cube [0, if and characterized by the 
covariance relation EXs • Xf = nj=i min(sy, tj). 

Kuelbs and Li (Kuelbs and Li, 1993) have discovered a tight connection between the 
Small Ball probabilities and the properties of the reproducing kernel Hilbert space corre- 
sponding to the process, which in the case of the Brownian Sheet is WM^([0, if), the space 



6 



D. BILYK, M. T. LACEY, AND A. VAGHARSHAKYAN 



described in the previous subsection. Their result, applied to the setting of the Brownian 
sheet in (Dunker et al, 1998), states that 

Theorem 2.11. In dimension d > 2, as e i we have 

-logP(||B||c([o,in < e) - e-Hlogl/ef iff logN(£) - e-Hlogl/ef\ 

Thus, in agreement with Conjecture 2.9, the conjectured form of the aforementioned 
probability in this case is the following: 

The Small Ball Conjecture for the Brownian Sheet 2.12. In dimensions d > 2, for the 
Brownian Sheet B we have 

-logP(IIBIIcM) < £) - e-\logl/ef'-\ e i 0. 

In dimension d = 2, this conjecture has been resolved by Talagrand in the already cited 
paper (Talagrand, 1994), in which he actually proved Conjecture 2.5 for a specific function 
(p and used it to deduce the lower bound in the inequality above. ^ In higher dimensions, 
the upper bounds are established, see (Dunker et al., 1998), and the previously known 
lower bounds miss the conjecture by a single power of the logarithm. 

Theorem 2.10 can be translated into the following result on the Small Ball Probability 
for the Brownian Sheet: 

Theorem 2.13. In dimensions d > 3, there exists 7] > such that for the Brownian Sheet B we 
have 

-logP(||B||c([o,i]'0 < £) ^ e-Hlogl/ef'-^^\ e i 0. 

3. Notations and Littlewood-Paley Inequality 

Let r G N** be a partition of n, thus r = (ri, . . . , r^), where the rj are nonnegative integers 
and \r\ '■= Tjt'''t = which we refer to as the length of the vector r. Denote all such vectors as 
H,,. ('H' for 'hyperbolic.') For vector r let 1^,^he all dyadic rectangles R such that for each 
coordinate k, \Rk\ = 2"'"'^. 

Definition 3.1. We call a function / an r function with parameter r if 
(3.2) f=Y^ej,hR, eRG{±l}. 

ReK^ 

A fact used without further comment is that f} = 1. 

J r 

As it has been already pointed out, the principal difficulty in three and higher dimensions 
is that the product of Haar functions is not necessarily a Haar function. On this point, we 
have the following 

The work of Talagrand bears strong similarities to the prior work of (Schmidt, 1972) and (Halasz, 1981). 
The argument of Talagrand was subsequently clarified by (Temlyakov, 1995), and (Dunker, 2000). 
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Proposition 3.3. Suppose that R^,. . .,Rk are rectangles such that there is no choice o/l < ; < 
y < k and no choice of coordinate 1 < t < dfor which we have Rj^t = R^^f Then, for a choice of 
sign E e {±1} we have 

k k 

(3.4) P]^R = £/zs, S = P|J^,. 

7=1 7=1 

Proof. Expand the product as 

{ id 

/lR„,(Xi, . . . , Xd) = PI PI ^R,m(^0 • 
m=l m=l f=l 

Here e„, G {±11. Our assumption is that for each t, there is exactly one choice of 1 < mo < 
such that Rmo,t = Sf. And moreover, since the minimum value of \Rm,t\ is obtained exactly 
once, for m + OTq, we have that /zr„,( is constant on Sf Thus, in the t coordinate, the product 
is 

This proves our Lemma. □ 

Remark 3.5. It is also a useful observation, that the product of Haar functions will have 
mean zero if the minimum value of ^m,t\ is unique for at least one coordinate t. 

Definition 3.6. For vectors r,- G N'', say that ri, . . . , rj are strongly distinct iff for coordinates 
1 < t < d the integers {r^ t : 1 < j < }} are distinct. The product of strongly distinct r 
functions is also an r function, which follows from 'the product rule' (3.3). 

The r functions we are interested in are: 
(3.7) f,^:=Y^sgn{a{R))hR. 

We recall some Littlewood-Paley inequalities, which are standard, and so we omit proofs. 
Littlewood-Paley Inequalities 3.8. In one dimension, we have the inequalities 

llr 1^^^ 

(3.9) 12], aM-) ^ VP X «7 li(-) ' 2 < p < oo . 

7cR ^ " 7cR V 

Moreover, these inequalities continue to hold in the case where the coefficients aj take values in a 
Hilbert space 'H. 

The growth of the constant is essential for us, in particular the factor is, up to a 
constant, the best possible in this inequality. See (Fefferman andPipher, 1997; Wang, 1991). 
That these inequalities hold for Hilbert space valued sums is imperative for applications 
to higher dimensional sums of Haar functions. The relevant inequality is as follows. 
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Theorem 3.10. We have the inequalities below for hyperbolic sums of r functions in dimension 
d>3. 



(3.11) WYj/rW ^{Pnf'^^'^r l<p<00. 



k1=n 

We recall a vector valued Harmonic Analysis inequality. 

Proposition 3.12. Let fj he a sigma field generated by dyadic rectangles in dimension 2. We then 
have 

-.1/2 r 



(3.13) 



I^E(<p,l^)' 



< 



2<p< 



oo . 



Proof This is one of many examples of a vector valued inequality in the Harmonic Analysis 
literature. This particular inequality admits a simple proof by duality recalled here for 
convenience. 

Since p > 2, we can appeal to a duality argument. We can choose g e L^P/^y of norm one 
so that 

YEicpjirjf -Vmi<Pi\rj)\g} 
<Y,m(p]\rj),g} 

i 

<Y,{cp]Mg) 



< 



p/2 



l|M^||(p/2)' 



^ ((P/2)' - 1)- 



p/2 



Here we have used Jensen's inequality and the self-duality of the conditional expectation 
operators. The operator M ^ is the (strong) maximal function on the plane, namely 



M g{x) = sup 



where the supremum is over all dyadic rectangles R. This maps U into U for all 1 < < oo, 
an inequality appealed to in the last line of the display above. Moreover, it is well known 
that the norm of the operator behaves as 

\\M\\,^,<{q-l)-\ l<q<2. 
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□ 

4. Proof of Theorem 1.5 

The proof of the Theorem is by duality, namely we construct a function W of O norm 
about one, which is used to provide a lower bound on the L°° norm of the sum of Haar 
functions. The details of this argument are similar to those of (Bilyk and Lacey, 2006). 

The function W will take the form of a Riesz product, but in order to construct it, we 
need some definitions. Fix < e < 1 to be a small number, ultimately of order l/cf. Define 
relevant parameters by 

(4.1) q = lan'j, h=\, 

(4.2) p = aq'n-^'-'^/' , p = ^n-^'-^^l\ 

Here a is a small positive constant, we use the notation h = \ throughout, so as not to 
obscure those aspects of the argument that dictate this choice, p'is a 'false' normalization 
for the sums we consider, while the larger term p is the 'true' normalization. Our 'gain 
over the average case estimate' in the Small Ball Conjecture is — n'^^. 

Divide the integers {1, 2, ... , n} into q disjoint intervals of equal length h,..., Iq, ordered 
from smallest to largest. Let At := {re ]H„ : ri G It). Let 

(4.3) Ft:=Y^ff, H„:= a{R)hj,. 

re At \R\=2-" 

Here, the fj^are as in (3.7). The Riesz product is a 'short product': 

i=l 

One can view the p?f as a 'poor man's sgn(Ff)', in that the Riesz product above tends to 
weight the region where the functions Ft align. Note the subtle way in which the false 
normalization enters into the product. It means that the product is, with high probability, 
positive. And of course, for a positive function f , we have Ef = ||F||i, with expectations 
being typically easier to estimate. This heuristic is made precise below. 
Proposition 3.3 suggests that we should decompose the product W into 

(4.4) W = 1 + W^ + , 

where the two pieces are the 'strongly distinct' and 'not strongly distinct' pieces. To be 
specific, for integers 1 < u < q,let 

l<v^<-<vt<q ?ieA„ t=l 
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Lsd ^ 
is taken to be over all ft G A,,, 1 <m <k such that: 

(4.5) the vectors {vt : 1 < m < k} are strongly distinct. 
Then define 

(4.6) := Yj • 

k=l 

With this definition, it is clear that we have 

(4.7) <H„, vf/sd^ ^ ^^^^ vpsd^ ^^b,^-^-^ ,2-" ' 

|R|=2-" 

SO that is our 'gain over the trivial estimate', once we prove that < 1 (estimate 

(4.14) below). Proving this inequality is the main goal of the technical estimates of the 
following Lemma: 

Lemma 4.8. We have these estimates: 

(4.9) P(^ < 0) < expi-Aq^-^'') ; 

(4.10) im|2<exp(flV'); 

(4.11) = 1 ; 

(4.12) imii<l; 

(4.13) ||W1li<l; 

(4.14) ll^^'^lli < 1 . 

Here, < a' < 1, in (4.10), is a small constant, decreasing to zero as a in (4.1) goes to zero; and 
A > 1, in (4.9) is a large constant, tending to infinity as a in (4.1) goes to zero. 

Proof. We give the proof of the Lemma, assuming our main inequalities proved in the 
subsequent sections. 

Proof of (4.9). Using the distributional estimate (6.3) of Theorem 6.1 proved in Section 5, 
and the definition of we estimate 

P(W <G)<Y IP(P f f < -1) 
t=i 
</ 

=2],F(pff <-fl-v/2-f) 

< exY>{-ca-'^q^-'^^) . 

Proof of (4.10). The proof of this is detailed enough and uses the results of subsequent 
sections, so we postpone it to Section 6, Lemma 6.5 below. 
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Proof of (4.11). Expand the product in the definition of The leading term is one. Every 
other term is a product 



keV 



where V is a non-empty subset of {1, ... , q}. This product is in turn a linear combination of 
products of r functions. Among each such product, the maximum in the first coordinate 
is unique. This fact tells us that the expectation of these products of r functions is zero. So 
the expectation of the product above is zero. The proof is complete. 

Proof of (4.12). We use the first two estimates of our Lemma. Observe that 

II Will = - 2EWlvi;<o 

< 1 + 2F{W < 0)^'^n2 

We have taken b = 1/4 so that 1 -2b — lb. For sufficiently small a in (4.1), we will have 
A > fl'. We see that (4.12) holds. 

Indeed, Lemma 6.5 proves a uniform estimate, namely 

sup E P|(l + pFtf < expia'q^^) . 

Vc{l,...,q] 

Hence, the argument above proves 
(4.15) sup ||r[(l + pfO <1. 

Proof of (4.13). The primary facts are (4.15) and Theorem 8.3; we use the notation devised 
for that Theorem. 

We use the triangle inequality, estimate (4.10) of Lemma 4.8, Holder's inequality, with 
indices q^^ and (cj^'^^ = q^^ l{q^^ - 1) , the inclusion-exclusion identity (8.2) and estimate (8.4) 
of Theorem 8.3 in the calculation below. Notice that we have 

x|j](l + pff) 



sup n^i + pf , < sup n^i+pff) 

^C(l qY^%^ W ) Veil <,)"tgi5- 

< 1. 

We now estimate 
(4.16) ||vi/-||i< 



teV 



p SumProd(X(G)) 



G admissible 



^ E 

dmissi 

I 



lip SumProd(X(G))||, 



G admissible 



n (1 + p^*) 

te[l,...,q]-V(G) 

te[l,...,q]-V{G) 



||p'^^^^iSumProd(X(G))|L 



G admissible 
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= L Z llp"'^''^'SumProd(X(G))||,.. 

v=2 G:\V(G)\=v 

< q^"n~'^ < n~'' < 1 . 

Proof of (4.14). This follows from (4.13) and (4.12), and the identity W = 1 + ^^'^ + 
together with the triangle inequality. 

□ 

5. The Analysis of the Coincidence 

Following the language of J. Beck (Beck, 1989), a coincidence occurs if we have two vectors 
r s*with e. g. r2 = S2. He observed that sums over products of r functions in which there 
are coincidences obey favorable L-^ estimates. We refer to (extensions of) this observation 
as the Beck Gain. We introduce relevant notation for this situation. For 1 < k < d and 
1 < h, tz < q, set 

(5.1) 0,,,,;,:= Yj fr-fs- 
re Afj ; sfe 

r^s 

rk=Sk 

Notice that due to our construction of the Riesz Product, there are no coincidences in the 
first coordinate in the decomposition of although the case = 1 is important for the 
proof of the estimate (4.10) . In the sum above, there are 2d - 3 free parameters among 
the vectors r and s. That is, the pair of vectors (r, s) are completely specified by their values 
in 2rf - 3 coordinates. The following lemma suggests that these parameters behave as if 
they were orthogonal. 

The Simplest Instance of the Beck Gain 5.2. We have the estimates below, valid for an absolute 
implied constant that is only a function of dimension d>3. 

(5.3) sup \\(^n,tj\p < V'-^'^ ■ n'-^l^ , 2 < p < oo , 

where the supremum is taken over alll <k < d and 1 < ti, t2 < q. 

This estimate is smaller by 1/2 power of n than what one might naively expect, and 
so we say that we have an average gain of 1/4 power of n in the products above. (Here, 
the average is in reference to the two functions we form the product of.) This Lemma, in 
dimension d = 3 appears in (Bilyk and Lacey, 2006). We will give an inductive proof of 
this estimate, that requires that we revisit the three dimensional case. In the next section, 
we also derive other estimates from the one above. 

The estimate above may admit an improvement, in that the power of p is perhaps too 
large by a single power, due to our use of Proposition 3.12. (There should also be a 
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dependence upon q, but on this point, and in many others, the arguments of this paper are 
suboptimal, and so we do not pursue this point here.) 

Conjecture 5.4. We have the estimates below, valid for an absolute implied constant that is only a 
function of dimension d>3. 

(5.5) sup \\Ot^,tJ\p < {pnf-^l^ , 2 < p < oo . 

Proof of Lemma 5.2. The proof is inductive on dimension. We shall suppress dependence 
on ti,t2. In fact, we shall prove the Theorem for the quantity 

(5.6) Oi:= fr-fs> 

ri=si 

and the claimed statement will follow with only minor adjustments. To set up the induc- 
tion, we need some definitions. 

Definition 5.7. Given a set of r functions {f^} and subset C c H^^ x • • • x H,,,, set 

t 

SumProd(C) = W_k 

(n n)e€ s=i 

Below, we will be interested in pairs and four-tuples of r functions. It is an important 
element of the argument, allowing us to run the induction, that we consider products of r 
functions where the vectors are in hyperbolic collections ]H„, for different values of n. 

The main quantity we induct on is then 
(5.8) S{d, n, p) = sup||SumProd(B)||p , d,n,p>3. 

B 

Here, the supremum is formed over all B c ]H„j x H^^ and all r functions subject to these 
conditions: 

• There is a coincidence in the first coordinate: For all (r, s) G B, we have r s*and 
ri = si. 

• Hi, n2 < n. That is the lengths of the vectors r and s*are permitted to be different. 

• No other restriction is placed upon the pairs of vectors in B. 
Our main estimate on these quantities is as follows. 

Lemma 5.9. We have the inequality below valid for all dimensions d>3. 
Bid, n, p) < p'^-i/2„rf-3/2 ^ p, n > 3 . 

The inductive argument for Lemma 5.9 has the underlying strategy of reducing dimen- 
sion by application of the Littlewood-Paley inequalities. But, this causes the collections of 
vectors to lose some of their symmetry. Regaining the symmetry causes us to introduce 
additional types of collections of vectors. Two of these collections are as follows. 

(5.10) C(rf,n,p) = sup||SumProd(C)||p, d,n,p>3. 

c 
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Here, the supremum is formed over all C c ]H„j x H^^ and all r functions subject to these 
conditions 

• There is a coincidence in the first coordinate: For all {r,s) e C, we have r ^ s*and 
ri = si. 

• For all (r, s) G C, we have r2 > S2 and < S3. 

• ni,n2 < n. 

• There is no other restriction on the pairs of vectors in C. 

The only difference between the present collections and the collections in !B{d, n, p) is 
that in the present collections we assume locations of maximums in the second and third 
coordinates, thereby permitting application of the Littlewood-Paley inequalities in those 
two coordinates. 

The second collection is less sophisticated. We simply assume that the maximum always 
occurs in say, the first coordinate. Define 

(5.11) V{d, n, p) = sup||SumProd(D)||p , d,n,p>3. 

D 

Here, the supremum is formed over all D c H^^ x H^^ and all r functions subject to these 
conditions 

• There is a coincidence in the first coordinate: For all (r,s) G D, we have r s*and 
ri = si. 

• For all (r, s) G D, and all 2 < j < d, we have Vj > Sj. 

• n2 < Ui < n. 

That is, we require that in each coordinate where there is a maximum, the maximum occurs 
in the vector r. 

Lemma 5.12. ]Ne have the inequality below valid for all dimensions d>3. 

C{d, n, p) , V{d, n, p) < p'^-^'^ ■ n'^-^'^ , p,n>3. 

We turn to the proofs of the Lemma 5.9 and Lemma 5.12, and begin by explaining the 
logic of our induction. Let !B{d) stand for the inequalities in Lemma 5.9 in dimension d, 
and likewise for C{d) and !D{d). We prove: 

• The inequalities !D{d) for all dimensions d. 

• The inequalities !B{3) and C(3). At the same time, assuming !B{d - 1), d > 4, we 
prove C{d). 

• Assuming C(rf) and D{d), we prove !B{d). 

These clearly combine to prove the two Lemmas, and so complete the proof of Lemma 5.2. 

The Inequalities D{d). The definition of D{d) permits the possibility of equality for a 
large number of coordinates of the two vectors. Let us exclude that case in this definition. 
Define 

(5.13) £)5t(d, n,p) = sup||SumProd(D)||p, d,n,p>3, 

D 
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where D is as in (5.11), but with the additional condition that for 2 < j < d we have Vj > Sj. 
Then, we are free to apply the Littlewood-Paley inequality in each of the coordinates from 
2 to d. 

Fix a collection of vectors O, and a collection of r functions which achieves the supremum 
in (5.13). For this collection, and a choice of vector p G N'*"^, let 

Dp^ = {(r^s) G D : r^+i = py, 1 < ; < d - 1} . 

Of course there are at most < n'^~^ values of p for which the collection above is non-empty. 
Then, 



{d-l)/2 



SumProd(D^3>)2 



<p(rf-l)/2^(rf-l)/2g^p 



SumProd(Dp-) 

But, the coordinate ri is completely specified in Dp-, and therefore does not contribute to 
the last norm. And so the first coordinate of s*is specified. Therefore, there are at most 
d - 2 free choices of parameters in the vector s. By application of the Littlewood-Paley 
inequalities, we have 

This is better than the claimed inequality. 

If there are a set / c {2,...,d} of coordinates for which rj = Sj for all / G /, then after 
arbitrarily specifying these values, we have will be in position to apply the inequality 
!D^{d - I/I, n, p). This will clearly give a smaller estimate. As the number of possible choices 
for / is only a function of dimension, this completes the proof. 

The Bounds S(3) and C(3). Assuming !B{d - 1), d > 4, we prove C{d). In this section, we 
will prove the estimates for C(3). As well, we present the inductive proof of C{d) assuming 
S{d - 1), for d>4. 

For the proof of C(3) there is an ancillary collection that we will have recourse to. Let 
(5.14) M(n,p) = sup||SumProd(M)||p 

M 

where the supremum is formed over all choices of M c H^^ x H^^ and all r functions 
subject to these conditions. 

• r, sare three dimensional vectors. 

• There is a coincidence in the first coordinate: For all (r, s) G C, we have r ^ s^and 
ri = si. 

• The second coordinates are fixed: There are integers Fi,F2 so that for all (r^s) G M 
we have r2 = Fi and S2 = F2. 

• There is no coincidence in the third coordinate: For all (r,s) G M we have S3. 
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• — 




— • 











ni 



Fixed Values 



lengths of the vectors 



Figure 1. The collections M, with a coincidence in the top row, the second 
row taking fixed values, and no coincidence in the bottom row. 



• ni,n2 < n. 

See Figure 1 for an illustration of this collection. We remark that in the case n\ na, a 
coincidence can occur in the third coordinate, a case that will come up below. 

Lemma 5.15. ]Ne have the inequalities 

(5.16) M{n,-p) < ^ - yfn. 

Proof. Notice that the value of the maximum in the third coordinate completely specifies 
the pair of vectors (r,s). Therefore, one application of the Littlewood-Paley inequalities 
completes the proof. For any collection M as above, let Ma be the (r, s) G M where the 
maximum in the third coordinate is a, maxjrs, S3} = a. Note that this can only consist, at 
most, of two pairs of vectors. 

1/2 



) SumProd(M„)2 < VP" 



□ 



Fix a dimension d > 3. Let B be the collection which satisfies the conditions associated 
with (5.8) that contains C. We introduce a conditional expectation into the argument, to 
gain some additional symmetry. Let Ta,b be the dyadic sigma field in the second and third 
coordinates generated by dyadic rectangles of side lengths 2"""^ and 2"^"^ respectively. 

We have this equality. 

(5.17) frfs = ^ X frMra,b)-SnmFrod{n,b\ 



(r,s)eC 
r2=a ,S3=b 



(r,s)eB 
r2=a ,S3=b 



where JDa,b consists of pairs of vectors (f*, s) G B such that ri = Si, a = r2 = S2 and b = = S3. 
In three dimensions, the set 'Da,b is empty, since the requirements for a pair of vectors being 
in the set Da,b forces r = s*, a contradiction. 
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Assuming that d > 3, using the assumption of !B{d - 2) ( in the case of = 4 we just apply 
the Littlewood-Paley inequality in the last coordinate), we see that 

(5.18) ||SumProd(D„,,)||p/2 < p'"'^' • n'-'^^ . 

Here, we have 'lost two dimensions' due to the roles of a, b. Therefore, using a trivial 
estimate in the parameters a, b, 

-.1/2 



SumProd(D„,fo)^ 



a,b 



^ pd-3/2^d-5/2 



This estimate is smaller than what the other terms will give us. 
Therefore, using (3.13) we can estimate 



(5.19) \\SumFrod{C)\\p < p'-"'n'-'/' + Yu ^^'^^ 



a,b {r,s)eB 



1/2 



p/2 



We concentrate on the latter term, and in particular expand the square. 



(5.20) Y L /^-/^ 



a,b {r,s)eB 
r2=a ,S3=b 



< n 



2d-3 



(5.21) + SumProd(B;) + SumProd(B2) 

(5.22) + SumProd(]B") 

where these terms arise as follows. In forming the square on the left in (5.20), we have two 
pairs (r, s), {r,s) eC with = r'^ and S3 = S3. We form the product 

(5.23) frfrfrfi 

• If the two pairs are equal, the product in (5.23) is one. There are < n^'^~^ ways to 
select such pairs. This is the right hand side of (5.20). 

• The collection B'^ consists of vectors such that r = f but s^s, the product in (5.23) is 
equal to fy- ft (B2 is defined symmetrically). Notice that necessarily we have Si = s^, 
which is equal to Vi, and S3 = S3. Let us set 

B^ = {{s,s) : si = s^ = c; S3 = S3} . 

We have 'lost' one parameter in B^ and have one more coincidence, therefore, we 
can apply the induction hypothesis !B{d - 1) to see that 

||SumProd(B^)||p < p^-y^n^-^n _ 

It is easy to see that 

SumProd(B;) = Y SumProd(B;j). 
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Wi n2 ^2 Hi <-—__^^^ lengths of the vectors 

^^^.....^^^^^kisMe ^ — ^ 
outside 

Figure 2. The Decomposition of B^'^ in the four dimensional case. Note 
that the coincidences are indicated by the connected black circles. 

Thus we have 

||SumProd(B;)|| < Yj ||SumProd(B;^)|| < n^"^ • p'^-3/2„rf-5/2 ^ ^d-3/2^d-7/2 _ 

This controls the term in (5.21). 
• The last term arises from two pairs of vectors (r, s), (f, s) G C that consist of four 
distinct vectors. Let us set 

B" = {{r'J,i,?) : (r-:s-),(f,DGC,r"^f, s'^H 

Here, for the sake of cleaner graphics, we have deliberately written s, fas the middle 
two vectors in the four- tuples in B". 

It remains to bound the term in (5.22). We reduce this four-fold product back to a product 
of two-fold products. For integers Fi,F2, let B^'^ be those (r,s*,f,r) G B" with ri = Si = fi 
and = = F2. Let 15^'^^^^^^^^ be the projection of four-tuples in B^'^ onto the first and 
fourth coordinates, and ^'^g^deFiFi projection onto the second and third coordinates. 
See Figure 2. 

For any pair {r,r) G IB"^^^.^^^^ and any two pairs 

(S'l), {0,q)^^'Lide,F^,F,' 

we have 



(r, s, |, r) , (r, o,o,r)e B"^ . 
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Therefore, we have the product formula 

SumProd(B;.; = SumProd(B;;^ X SumProdCB:;^^,^ ,^ . 

Notice that the pairs of vectors in B^'^^^. have their first coordinates fixed, and have 
a coincidence in the second coordinate. The fixed first coordinates need not be the same, 
so that the lengths of the remaining coordinates are, in general, distinct. Still, we may 
conclude that 

||SumProd(B;;,3,,^^,^^,;i|p < /-^'^n'-^l' . 

This estimate is uniform in Fi,F2. In the case of dimension d = 3, this follows from 
Lemma 5.15, while for > 3 it follows from the induction hypothesis. A similar inequality 
holds for B" . , ^ ^ . 



Therefore, we can estimate the term in (5.22) as follows: 

1/2 



SumProd(B") < pn supllSumProdCB;;,^.,^ ,^ X SumProdCBf;^.,^ ,^ ,^)||;;^ 



< pnsup||SumProd(B;;,^,,^^,^^,;||;/^ x l|SumProd(B:; .,^,^^,^)||;/^ 

rf-3/2 



^ ipny 

Our proof is complete. Assuming !B{d - 1), d > 4, we have proved C(d). We have also 
proved C(3). The fact that !B{3) holds follows from the argument below. 

Assuming C{d) and !D{d), we prove !B{d). Fix p, n > 3, a collection of vectors B and r 
functions which achieve the supremum in (5.8). Write this collection as 

B = DU y 

where C, y consists of those pairs (r^ s) G B such that i is the first coordinate for which > s, 
and j is the first coordinate for which rj < Sj. Then, the collections Cj y are pairwise disjoint, 
and the collection D consists of all pairs not in some Cij. Thus, 

SumProd(B) = SumProd(D) + Yj SumProd(C/,y) . 

2<i^j<d 

After a harmless permutation of indices, the inequalities C(rf) apply to the collections 
Cij. The (unconditional) inequalities D apply to the collection D. The proof is complete. 

6. Corollaries of the Beck Gain 

Theorem 3.10 implies an exponential estimate of order exp(L^/('*"^^) for sums of r func- 
tions. In fact, we can derive a subgaussian estimate for such sums, for moderate deviations, 
and moreover, in order to have a gain of order rf^'^ in our Main Theorem, we need to use 
this estimate. 
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Theorem 6.1. Using the notation of (4.2) and (4.3), we have this estimate, valid for all 1 < t < q. 

(6.2) WpFtWp < Vp / 1 < p < cn^S . 

As a consequence, we have the distributional estimate 

(6.3) P(|pff| > x) < exp(-cx^), x<cn^. 
Here < c <1 isan absolute constant. 

To use (6.3), we need q'^ = a^n^'^ < cn^, and so e ^ is the optimal value for e that 
this proof will give. 

Proof. Recall that 

reAt 

where Af := {r G H,, : ri G It], and Jf in an interval of integers of length njq, so that 
Apply the Littlewood-Paley inequality in the first coordinate. This results in the estimate 

selj r:ri=s 

< vpih+p'^wii;;' 

< vp{i+iip'^wii;;2)' 

where Of f i is defined in (5.1). Here it is important to use the constants in the Littlewood- 
Paley inequalities that give the correct order of growth of ^fp. Of course the terms 
are controlled by the estimate in (5.3). In particular, we have 

(6.4) \\p^%,a\\p < ^p'"'/V-3/2 < q /-y^n^l^ < 1 . 
Hence (6.2) follows. 

The second distributional inequality is a well known consequence of the norm inequality. 
Namely, one has the inequality below, valid for all x: 

F(pf f > x) < CY^^x-P , l<p< cn^^ . 
If X is as in (6.3), we can take p - to prove the claimed exponential squared bound. □ 

We shall now use the Beck Gain to prove the crucial estimate (4.10) of Lemma 4.8. We 
actually need a slightly more general inequality: 

Lemma 6.5. We have the following estimate: 

(6.6) sup E P|(l + pFtf < expia'q^^) . 

Vc{l,...,q] j,gy 
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The supremum over V will be an immediate consequence of the proof below, and so we 
don't address it specifically. 

Proof of (4.10). Let us give the essential initial observation. We expand 
E 11(1 + pFjf = E + 2pFj + (pFjf) . 

7=1 7=1 

Hold the last d - 1 coordinates, X2, . . . , Xd, fixed and let !F be the sigma field generated by 
fi,...,f,_i. We have 

7^ ie(i + 2pF, + {pF,f \r) = \ + ^[(pF,f I r) 

= i+flV'"' + p'E(Og,g,i|:r), 

where ^q,q,i is defined in (5.1). Then, we see that 

q q-1 

E + 2pF, + (pFtf) = e{[](1 + 2pFt + (pF,f) x e(i + 2pF, + (pF,f \ t)] 

11=1 -0=1 

q-1 

(6.8) < (1 + aY~')^ + 2pFt + ipFtf) 

v=l 
q-1 

(6.9) + ■ + 2pFf + {pF,f) 

D=l 

This is the main observation: one should induct on (6.8), while treating the term in (6.9) as 
an error, as the Beck Gain estimate (5.3) applies to it. 

Let us set up notation to implement this line of approach. Set 

V 

N(y;r):=||P](l + pff)|[, V = l,...,q. 

t=i ^ 

We will obtain a very crude estimate for these numbers for r = 4. Fortunately, this is 
relatively easy for us to obtain. Namely, q is small enough that we can use the inequalities 
(6.2) to see that 



N(y;4)<]^||l + pFt||4y 



v=l 

All+h\V 



<{1 + Cq"^^y 

We have the estimate below from Holder's inequality 
(6.10) N{V-2{1 - II q)-^) < N(y; 2)1-1/'? •N(y; 4)1/"?. 
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We see that (6.8), (6.9) and (6.10) give us the inequality 

N{V + l;2f < (1 + aY~')mV}2f + C ■ N{V;2{1 - l/q)-'f ■ ||p20y+i,y+i,i||, 

(6.11) < (1 + aY-')N{V;2f + CNiV;2f-^'^ ■ NiV-Af''\\p'Ov+^,v^i,i\\, 

< (1 + a^q^''-^)N{V;2f + Cq^+^n-^'^N{V;2f-^^^ . 

In the last line we have used the inequality (5.3). Of course we only apply this as long as 
N{V; 2) > 1. Assuming this is true for all V > 1, we see that 

N{V + l-2f < (1 + a^q^^-^ + Cq^+^n-^'^)N{V;2f . 
And so, by induction, 

N{q;2) < (1 + aY~' + Cq'^^n^'y^ < e'"''"' . 

Here, the last inequality will be true for large n, provided that e in the definition of (4.1) 
is small. Indeed, we need 

a q > Lq n ' 

Or equivalently. 

Comparing to the definition of g in (4.1), we see that the proof is finished. □ 

One should notice that the results of this section suggest that our methods give a gain 
of the order j. 

7. The Beck gain with fixed parameters. 

We will need to analyze longer products of r functions. These longer products will 
be reduced to the case of a a slightly more general version of the Beck Gain Lemma 5.2. 
Namely, we will consider sums of products of two r fucntions, but impose the additional 
restriction for some coordinates in the pair of vectors to have fixed values. Let a G and 

h G ]N^2 be integer vectors with lengths \a\,\b\ < n. We will be estimating the quantity: 



(7.1) S(fi,f2)= sup sup||SumProd(B)||p, d,n,p>3. 

a,b,h<h ^ 

The inner supremum is formed over aU B c ]H„ x ]H„ and all r functions subject to these 
conditions: 

• re Ayj, s*G Aj^, where ji < j2 (i.e. Si is the maximum in the first coordinate.) 

• There is a coincidence in the second coordinate: For all (r, s) G B, we have r ^ s^and 

r2 = S2. 

• For = 1, . . . ,f 1, we have Tk+i = ci-k- (fi coordinates of r are fixed.) 

• For k = 1, ...,f2, we have SF^+k+2 = bk- (f2 coordinates of s*are fixed, and these 
coordinates are distinct from the other vector.) 



SMALL BALL INEQUALITY IN ALL DIMENSIONS 



23 



We have the following estimate, which gives an average Beck Gain of n^^^ for each of 
the two functions in the product. 

Lemma 7.2. We have the inequality below valid for all dimensions d>3. 



S(Fi,f2)<p' 



Mi 2 in 2 4 



p, n > 3 . 



Proof. We will reduce this situation to the Beck Gain proven before. Let B be as above. 
First of all, we shall apply the Littlewood-Paley inequality in the first coordinate. Notice 
that the maximum in this coordinate is automatically Si. 



(7.3) ||SumProd(B)||p < ^Jf 



celL 



(r,s)eB 

Si=C 



1/2 



p/2 



We concentrate on the latter term, and in particular expand the square. 



(7.4) Vp 



(7.5) 



cel 



,2 ('-,S)6B 
Si=C 



1/2 



p/2 



= VP 



Yu frfrfrft 



Si=S, 



1/2 
p/2 



< ^fpn max 



fr'fs'ff' fs 



{r,s,r,s)eBxB 
S] =s J ; r2 =S2 =c; £2 =£2 =£ 



(7.6) + ^/p^fii max ^ fr fr fr fi 

(r,s/,DeBxB 

We start with the estimates for the first term above (7.5): 



1/2 
p/2 
1/2 
p/2 



\/pn max 

etc 



fr'fs'fr' fs 



(r,s*,r,|)eBxB 
si =s J ; )-2 =S2 =c; £2 =£2 =£ 



1/2 



p/2 



= ^fpn max 



< -Jvnmax 



1/2 



p/2 



1/2, 



L /^-/r II L /^-/l 

(r,r)6Bi P (s',DeB2 



1/2 



Here Bi is defined to consist of pairs (r, f) e which satisfy the following: 



• For A: = 1, . . . , f 1, we have rk+2 = L 

• r2 = c,r^ = c. 



Lk+2 



ak. 
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And similarly B2 consists of pairs {s,s) G A^^ with the properties: 

• For = 1, . . . ,f 2, we have Sk+F,+2 = h+F^+i = 

• S2 = C, £2 = c. 

• Moreover, we have Si = S-^. 

Notice that because of the last condition and the fact that ci^ c (i.e., si^s), the Beck Gain 
(Lemma 5.2) applies to this family of pairs, giving a gain of n^''^, while Bi will be estimated 
by simple parameter counting, supplying no gain. We have 

||SumProd(Bi)||p < (pn)''-2-f^ 

||SumProd(B2)||p < p'i-3/2-F2„d-2-F2-l/2 _ 

And thus we can estimate the term (7.5) by 

1/2 



Vpn 



{r,s,i,s)eBxB 
Si=s,;r2=S2=c; r. 



ipny 



The second term (7.6) satisfies the same bound in n. This can be shown by simple 
parameter counting, the gain comes from the loss of one parameter since c = c. 

We remark that in this version of the Beck gain 'error terms' do not arise, since we apply 
Littlewood-Paley inequality only in the first coordinate, where we already have a natural 
order. Thus we do not need to use the conditional expectation argument as in the proof of 
Lemma 5.2. 

□ 



8. The Beck Gain for Longer Coincidences 

In the present section we treat longer coincidences. This requires a careful analysis of 
the variety of ways that a product can fail to be strongly distinct. That is, we need to 
understand the variety of ways that coincidences can arise, and how coincidences can 
contribute to a smaller norm. Following Beck, we will use the language of Graph Theory 
to describe these general patterns of coincidences. 

Graph Theory Nomenclature. We adopt familiar nomenclature from Graph Theory, al- 
though there is no graph theoretical fact that we need, rather the use of this language is 
just a convenient way to do some bookkeeping. The class of graphs that we are interested 
in satisfies particular properties. A d-1 colored graph G is the tuple {V{G), E2, E3, . . . , Ed), 
of the vertex set V{G) c {!,..., q}, and edge sets E2, E3, . . . E^, of colors 2,3,...,d respectively. 
Edge sets are are subsets of 

Ej c V{G) X V{G) - {{Kk) I k e V{G)} . 

Edges are symmetric, thus if {v,v') G Ej then necessarily {v',v) G Ej. 
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A clique of color j is a maximal subset Q c V{G) such that for allv ^ v' e Q we have 
{v, v') £ Ej. By maximality, we mean that no strictly larger set of vertices Q' d Q satisfies 
this condition. 

Call a graph G admissible iff 

• The edges sets, in all - 1 colors, decompose into a union of cliques. 

• If Qk's are cliques of color k{k-l,...,d), then 0^=2 Qfc contains at most one vertex. 

• Every vertex is in at least one clique. 

A graph G is connected iff for any two vertices in the graph, there is a path that connects 
them. A jjath in the graph G is a sequence of vertices Ui, . . . , Ujt with an edge of any color, 
spanning adjacent vertices , that is {Vj,Vj+i) G Wj.^^E^. 

Reduction to Admissible Graphs. It is clear that admissible graphs as defined above are 
naturally associated to sums of products of r functions. Given admissible graph G on 
vertices V, we set X(G) to be those tuples of r vectors 

veV 

SO that if {v, v') is an edge of color / in G, then r^,,; = r^',;. 

We shall introduce the following counting parameter: for an admissible graph G, its 
index, ind{G), is defined as 

(8.1) ind{G)= (i^Q-l)- 

Q is a clique 

Effectively, the index of G is the least number of equalities, needed to define X(G), in other 
words, the number of coincidences. In particular, for the graphs, corresponding to the 
simplest case of the Beck Gain, the index is one. 

With these definitions at hand, it is not hard to obtain the Inclusion-Exclusion formula, 
relating admissible graphs and the 'not strongly distinct' part of the Riesz product: 

(8.2) = Yj (-1)'""^*^'^^ p SumProd(X(G)) • H + P^*^' 

G admissible ttV{G) 

We will prove the following Theorem: 

Theorem 8.3. Beck Gain for Graphs For an admissible graph G on vertices V we have the 
estimate below for positive, finite constants Co, Ci, C2, C3; 

(8.4) pl^l||SumProd(X(G))||p < [Co\Vf'p^'q^'n-'^f^ , 2 < p < 00 . 

The most significant term on the right is n~^. It shows that as the number of coincidences 
goes up, the corresponding 'Beck Gain' improves. Notice that for the other terms on the 
right, Co is a constant; \ V\ < q <n'^, where we can choose < e as a function of r{; and while 
the inequality above holds for all 2 < /? < 00, we will only need to apply it for p <q^ < n^^'^. 
That is, the is the dominant term on the right. This TTieorem, together with the fact that 
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there are at most |yp'*l^l admissible graphs on the vertex set V , yields the boundedness of 
the sum in (4.16). 

Norm Estimates for Admissible Graphs. We begin the proof of Theorem 8.3 with a 
further reduction to connected admissible graphs. Let us write G G BG(Co, C\, Ci, C3, r]) if 
the estimates (8.4) holds. ('BG' for 'Beck Gain.') We need to see that all admissible graphs 
are in BG(Co, C\, C2, C3, r\) for non-negative, finite choices of the relevant constants. 

Lemma 8.5. Let Co, Ci, C2, C3, r\ he non-negative constants. Suppose that G is an admissible 
graph, and that it can he written as a union of subgraphs Gi, . . . ,GkOn disjoint vertex sets, where 
all Gj e BG(Co, Ci, C2, C3, t]). Then, 

GGBG(Co,Ci,C2,C2 + C3,r]). 

With this Lemma, we will identify a small class of graphs for which we can verify the 
property (8.4) directly, and then appeal to this Lemma to deduce Lemma 8.3. Accordingly, 
we modify our notation. If ^ is a class of graphs, we write Q c BG(r]) if there are constants 
Co, Ci, C2, C3 such that c BG(Co, Ci, C2, C3, ?]). 

Proof. We then have by Proposition 8.6 

k 

SumProd(X(G)) = Y[ SumProd(X(G/)) . 

;=i 

Using Holder's inequality, we can estimate 

k 

p"^l||SumProd(X(G))||p < [] pl^/'||SumProd(X(Gy))||,p 

7=1 
k 

< YliCoikpf'q^'n-'^f'^ 

7=1 

< [Cop^l^?^^+^^n-'']l^l. 

Here, we use the fact that since the graphs are non-empty, we necessarily have k < q. 

□ 

Proposition 8.6. Let Gi, . ..,Gpbe admissible graphs on pairwise disjoint vertex sets Vi, . . .,Vp. 
Extend these graphs in the natural way to a graph G on the vertex setV=[j Vt. Then, we have 

p 

SumProd(X(G)) = Yl SumProd(X(GO) • 

f=i 
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Connected Graphs Have the Beck Gain. We single out for special consideration the con- 
nected admissible graphs G . Let ^connected be the collection of of all admissible connected 
graphs on y C {1, . . . , q}. 

Lemma 8.7. We have ^connected c BG(7]) /or some 7] > 0. 

The point of this proof is that we will reduce this question to a much simpler key fact, 
namely Lemma 7.2, which we restate here in our current notation.'^ 

Let ^fixed(2) be the set of graphs — and sets of r functions associated with the graphs — 
with these properties: 

• G is a connected graph on two vertices {v,v'}. That is, there is at least one edge 
that connects these to vertices. Denote by C c {2,...,d} the set of coordinates 
corresponding to the edges. 

• There are a set of coordinates f f i,/ c {2, . . . , rf} that are disjoint from the set of 
edges, and two vectors ae N^'" and a' G N^"', so that we define 

Y(G) := {(r^, r^') e ]H„ : r^,,; = Tj^'^j 'Vj eC; r^,k = ak^k&Fy; r^^^j, = ak^k& F^^} 

These are in essence the assumptions of Lemma 7.2. This Lemma proves that 

||SumProd(y(G))||p < p^n° , o = d-l- - i . 

By abuse of notation, let us summarize this inequality by the inclusion ^fixed(2) c 
BG(Co, Ci, rf/2, 0, 1/8). Or, even more briefly, as ^fixed(2) c BG(l/8). That is, there is a gain 
of I for each vertex. It follows from the proof of Lemma 8.5, that if G is any graph whose 
connected components are each elements of ^fixed(2), then G G BG(l/8). 

Our line of attack on this Lemma is to take a general connected graph G, use the triangle 
inequality to assign fixed values to a number of edges, making the connected components 
of the new graph to be elements of ^fixed(2). The proportion of vertices that will be in one 
of these graphs will be at least 1 /2rf of all vertices. And therefore connected graphs will be 
inBG(l/16d). 

Remark 8.8. A heuristic guides this argument. The normalization p'^' in (8.4) assigns a 
weight n"^^^ to each free parameter of X(G), ignoring losses of parameters from the edges 
of G. If {v, v') is an edge in the graph, and we assign the edge one of n possible values, the 
full power of n is exactly compensated by the collective weight of the two parameters in 
the edge. Therefore, we are free to fix a fixed proportion of edges in the graph, obtaining 
a Beck Gain on the remaining proportion. In this argument, if the edge is in a clique of 
size at least k>3, specifying a single value on this clique actually leads to a positive gain 
of n'^l'^*'^. In other words, graphs, all of whose cliques are of size two, are extremal with 
respect to this analysis (see Lemma 8.9). This heuristic is made precise in the proof below. 

The only points that recommend the proof we describe here is that it is easy to state and delivers a gain. 
Clearly, a more sustained analysis, yielding a larger gain would result in an improved result on the Small 
Ball Conjecture. 
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By 'deleting a clique' we shall mean fixing a value of the coincidence which corresponds 
to that clique. Let G G ^connected- Following the heuristic above, in the first step of the 
algorithm we delete all cliques of size at least 3 in G. 

After this step G breaks down into connected components, which are admissible graphs 
with cliques only of size 2 (and, possibly, some singletons). Next, we want to obtain an 
estimate for such graphs. 

Lemma 8.9. Suppose G e ^connected has cliques of size at most 2. Then G e BG(^). 

To prove this statement we shall use the following property of G: 

• The degree of each vertex in G is at most d - 1 (since the degree in each color is at 
most one). 

Let V be the set of vertices of G, and E be the set of all its edges. The point is to select a 

maximal subset Ejndpndt of independent edges. That is, no two edges in Emdpndt/ regardless 
of color, have a common vertex. It is an elementary fact that we can take 

(8.10) |£indpndtl>2i3|£|. 

Indeed, each edge in G shares a vertex with at most 2d -4 distinct edges, which observation 
directly implies the inequality above. 

We delete all other edges of G (i.e. we fix some choice of parameters for the corresponding 

coincidences) and thus G breaks down into a number of components each of which is either 
a singleton or a graph with two vertices and one edge. The latter components correspond 
exactly to the situation in which the Beck gain of the previous section is applicable. Let 

us denote these pairs by G^ e ^fixed(2), k = 1,...,N = |£mdpndtl; the singletons - by Vj, 

i = 1, . . . , |y| - 2N. Let also E' - E - Eindpndnt denote the set of all deleted edges in G. 
Denote also by the number of fixed parameters in X(Gfc) and F'j will be the number of 

fixed parameters in f\,.. We have the following relations: 

_ _ N |y|-2N 

(8.11) 2|£'|=2|£-£indpndntl = 2],ffc+ Z ^'j' 

k=l j=\ 

and, since G is connected, it has at least \V{G)\ - 1 edges, thus 



(8.12) N> 



|E| 



1^1-1 
> — > 



1^1 



> 



2A-3~ 2A-3 ~ 2{2A - 3) ~ 4rf 
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Besides, by Proposition 8.6, we obtain the following equality (the sum below is taken over 
all choices of parameters on the 'deleted' edges): 



N \V\-2N 



(8.13) SumProd(X(G)) = J], H SumProd(X(GO) • f| SumProd(X(z;y)) . 

k=l 7=1 

Now we apply the triangle inequality. Holder's inequality, the relations (8.11) and (8.12), 
and the Beck gain in the form of Lemma 7.2 to estimate (k = \V\-N < q): 



N \V\-2N 

nWf^ II 

Kp 



pl^l||SumProd(X(G))||p < n'^'l ■ JJ p'\\SumFrod{X{Gk))Lp " [] '""-^'''■A'. 

k=l i=l 



N \V\-2N r 



F'. 

p{Kpn) 2 2 



k=l 7=1 L 

[_ sti d-\\V\ _N Y izl i 1 il^l 
Cp ^ -n i < \p 2 q2 n ^ 

This proves Lemma 8.9. The point of passing to the collection of independent edges is 
that SumProd(X(G)) splits into a product of terms associated with graphs in ^fixed(2). Each 
of these graphs leads to a gain of at least | for each vertex. But by (8.11), there are at least 
^|y(G)| vertices for which we will get this gain. This shows that G G BG(l/16rf). 

We can now proceed to prove Lemma 8.7 - the proof will be in the same spirit. After 
we delete "large" (of size at least 3) cliques of G, this graph decomposed into some 
singletons and components as in Lemma 8.9 (but with some parameters fixed). Denote 

these components by Gk, k = 1, . . . , ni and the singletons by Uj, j = 1, . . . , n2. Let fk be 

the number of fixed parameters in X(G)t) and and f- - the number of fixed parameters in 

Tuj- Notice that the proof of Lemma 8.9 can be trivially adapted to the case when some 
parameters are fixed to obtain the estimate: 

\Vk\ _fk 



(8.14) pl^'l||SumProd(X(GO)||p < [Cp^q^rT^] ' n 



Also, if we denote by K the total number of fixed cliques, one can see that, since all the 
cliques had size at least 3, we have the inequality: 



"1 "2 



(8.15) ?,K<Y,fk + Yuf'i- 

k=l 7=1 

Let us write the set of vertices of G as V = Vi U V2, where Vi are the vertices involved in 
at least one of the deleted cliques and V2 are all the other vertices. It is easy to see that 

V2 c U'^l^V{Gk). Indeed, all the vertices that became singletons had to be a part of one of 
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the deleted cliques. Thus, 



111 



(8.16) \V2\<Y,MGk)\. 
Besides, it is easy to see that 

111 112 

(8.17) \Vi\<Y,f^ + Yjf'i' 

k=l /=1 

because at least one parameter is fixed in each vertex from a deleted clique. Using these 
relations, similarly to the proof of Lemma 8.9, taking k = tii + n2 < q, we can write: 



111 "2 

Kp 



pl^l||Prod(X(G))||p < • n p"'^''^^'llProd(X(G,))|k.p • p||/^,^ 

k=l /=1 



«i — i2 r f 



k=i y=i 



p 2 q n 2 



9. The Lower Bound on the Discrepancy Function 

We give the proof of Theorem 2.4, which is essentially a corollary to the proof of our Main 
Theorem, Theorem 1.5. As such, we will give a somewhat abbreviated proof. Indeed, the 
analogy between the lower bound on Discrepancy Functions and the Small Ball Inequality 
is well known to experts. 

Theproof is by duality. Fix N, and take 2N < 2" < 4N. It is a familiar fact (Beck and Chen, 1987) 
that for each \r\ = n we can construct a r function ff such that 

(9.1) {Dn,/?)>c>0, 

where c depends only on dimension. We use these functions in the construction of the test 
function, following § 4, with this one change. Before, see (4.3), we took li,.. . ,lci to be a 
partition of {1, 2, ... , n] into q disjoint intervals of equal length. Instead, we take 

(9.2) 7f :={;gN : \j - tnlq\ < ql^ . 

This is the only change we make in the construction of M^®*^. It follows that < 1. 

Recall that W^'^ = Ej^^ , see (4.5). By construction, we have 

t=l reAt 
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> ^h^(d-l)l2 ^ ^e/4+(rf-l)/2 _ 

This is a 'gain over the average case estimate' as one can see by comparison to Theorem 2.2. 
It remains to see that the higher order terms W^'^ contribute smaller terms than the one 
above. 

By construction, is itself a sum of r functions /^with |s1 > n. Indeed, it follows from 
the separation in (9.2) that we necessarily have 

(9.3) n + kf^<\s\<nd. 

Second, it is a well known fact that KDn,/^)| < N2"''^. Third, we fix s*as above, and set 
Count(s) to be the number of distinct ways can we select ri, . . . , r^, all of length n, so that 
the product /;?■•■ /^^ is an r function of parameter s. A very crude bound here is sufficient, 

Count(s) < \s\^^-^^^ . 
Thus, we can estimate 

j>n+k^ V- Is1=/ 
< ^d(k+3)2-kn/2q _ 

As q = rf , this is clearly summable in > 1 to at most a constant. This completes the proof. 

10. The Proof of the Smooth Small Ball Inequality 

We prove Theorem 2.7. There is no loss of generality in assuming that \a{K)\ < 1 for all 
R of volume at least 2"", since both sides of (2.8) are homogeneous and sums have finitely 
many terms. With (p as in the theorem, set 

R:\Rj\=2'''i 

And let O = Y,\p\=n'Pr- Define the r functions as in (3.7). It is the assumption that = 
{(p, /z[-i/2,i/2]) ^ 0, and in fact we will assume that this inner product is positive. Thus, 

(10.1) <(p,^/^) = c^2-" Yj 

R-.lRjl^l'i 

Ascpe C[-l/2, 1/2], we have 

(10.2) \{(p.hj)\<C^\}\ 

for all dyadic intervals /. 
It is important to note that 

(10.3) K^,-,m S oLw^sI' 
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The first line follows from the fact that cp is supported on [-1/2, 1/2], so that if e. g. Si < ri, 
the fact that (p has mean zero proves this estimate. The second estimate follows from (10.2) 
and the assumption that the coefficients a{R) are at most one in absolute value. 
Let us take the intervals It in (9.2), and let us assume that 

(10.4) Yj^a{R)\<4:Y^Yj Tj I^^^^I- 

|R|=2" f=l reA,R:|R.|=2"'7 

If this inequality fails, it is an easy matter to redefine the It so that the inequality above is 
true, and adjacent intervals It,It+i are seperated by n/q. 

We then follow § 4 as before to define our test function W^'^. It follows that < 1. 

Using (10.4), (10.1) and (10.3), we have 

f=l reAt R:lRj\=2''i 
|R|=2-" 

This is the main term. 

It remains to see that the inner products KO,^f )| are small k > 1. The details of this 
calculation are very similar to the corresponding calculuations in the previous section, 
hence they are omitted. 
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